Detecting degree symmetries in networks 
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The surrounding of a vertex in a network can be more or less symmetric. We derive measures of a specific kind 
of symmetry of a vertex which we call degree symmetry — the property that many paths going out from a vertex 
have overlapping degree sequences. These measures are evaluated on artificial and real networks. Specifically 
we consider vertices in the human metabolic network. We also measure the average degree-symmetry coefficient 
for different classes of real-world network. We find that most studied examples are weakly positively degree- 
symmetric. The exceptions are an airport network (having a negative degree-symmetry coefficient) and one-mode 
projections of social affiliation networks that are rather strongly degree-symmetric. 
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I. INTRODUCTION 

With the advent of modern database technology numerous 
large scale network data-sets have been made available. This 
development has triggered a surge of activity in studies of sta- 
tistical network properties jlj |5( I21I) . The underlying idea 
of these studies is that the network structure (the way the net- 
works differ from completely random networks) contain some 
information of the function, both locally and globally, of the 
network. Hence a common theme in these works has been the 
development of structural measures to characterize network 
structure. In this paper we propose and evaluate a measure 
of a previously unstudied network structure — a special case 
of symmetry we call degree symmetry. In geometry an object 
is symmetrical if it is invariant to rotations, reflections, and 
so on. In networks, with no given geometrical embedding, 
these concepts have to be relaxed. Furthermore, we would 
like to have a continuous measure saying not only if a vertex 
is a local center of symmetry or not, but also how symmetric 
the vertex is. The aspect of symmetry we address is, roughly 
speaking, that if you look at the object (network in our case) 
in different ways from a symmetric vertex it still looks the 
same. We process of "looking" will in our case be walking 
along paths (non-self intersecting sequences of edges). Fur- 
thermore, since degree (number of neighbors) is commonly 
regarded as the most fundamental quantity relating a vertex to 
its function, we say two vertices "look the same" if they have 
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FIG. 1 Illustrations of degree symmetry. Consider paths of length 
two (i.e. / = 2). All paths out from the central (black) vertex have 
the degree sequence (3, 2) meaning the central vertex has high degree 
symmetry. 



the same degree. We will thus derive our measure by perform- 
ing walks along all paths from a vertex and compare the se- 
quence of degrees of the vertices along these paths. The situa- 
tion we have in mind is depicted in Fig.^ — a H paths from the 
central vertex have degree sequences starting with (3,2, • ••), 
thus the central vertex is highly degree symmetric. 

The rest of the paper is organized as follows: First we give a 
detailed derivation of the degree-symmetry coefficient (in two 
different versions, appropriate for different needs). Then we 
evaluate these on example networks and a biochemical net- 
work. Finally we discuss the average degree symmetry of dif- 
ferent classes of real-world networks. 



II. DERIVATION OF THE MEASURE 

We will consider the network represented by a graph G - 
(V, E) of N vertices, V, and M edges, E. For a vertex i to 
have high degree symmetry it has, as mentioned, to have many 
paths with the same sequence of degrees. We will use a cut-off 
I for the pathlength and consider only paths of that length. The 
reason for this cutoff is threefold: First, in all (with possibly 
some curious exception) network processes, a vertex is more 
affected by its closest surroundings then vertices further away. 
Thus one would like to have a lower weight on the contribu- 
tion from distant vertices. Second, the number of vertices n 
steps away grows fast with the distance from i. For finite net- 
works this means that the paths soon reach the periphery of 
the network where unwanted finite-size effects set in. Third, 
for computation speed, one benefit from a cutoff. 

Assume there are p paths of length I from a vertex i. We 
then denote the degree sequences of these paths 



6/(o = [[k(vy,---,k(v l w )i 



(l) 



where k( v) denotes the degree of a vertex v and v 1 ■ , is the /th 
vertex of along the m'th path of length I leading out from i. 
Then if there are unexpectedly many vertices at the same (j-) 
index in the sequence with the same degree, the vertex i is a 
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FIG. 2 Illustrations of of concepts in the derivation of the degree 
symmetry coefficient, (a) illustrates the branching number. Con- 
sider paths of length three out from i. The branching number of the 
path (i, j) is five (there are five paths from i of length three that goes 
through j). The branching number at / is two. (b) shows the set 
A(P, i), where P is the path (i, j, /). 



local center of degree symmetry. A rough symmetry measure 
would thus be to count the fraction of index-pairs with the 
same degree, i.e. 



0<n<n'<p 7=1 



where 



A = (7-1) 'J and S(x,y) 



1 if x - y 
ifx £y 



(2) 



(3) 



This measure is very crude and lack many desired statistical 
features. For example, all paths that go via a particular neigh- 
bor of i will give a contribution to the sum. In practice this 
means that vertices with a high degree vertex rather far from 
itself (but closer that I) will trivially have a high S/(/)/A. A 
first step would thus be to omit the contribution of vertices oc- 
curring in many sequences of (2/(0 at a specific index. I.e., for 
all /' e (0, one wants to exclude the terms 



(4) 



where n and n' are indices of paths that are identical the first V 
steps, from Eq. 0- Let 5 ,(0 denote the number of such terms. 

To calculate 5 ,(0 consider a path P = (/,■••, j) of length 
/' < /. Let b/(P, i) be the number of paths from i of length I that 
start with the path P. We call b/{P, i) the branching number of 
P, see Fig. Eta). All pairs of paths starting with P will con- 
tribute to 5,(0 a distance /' from i (since they all pass through 
j). Let A(P, i) be the set of neighbors to j that is not on the 
path P from i to j, see Fig.|2b). (The number of elements in 
A(P, i) is thus kj - 1.) This situation gives a contribution 



(5) 



from vertices of indices in the interval [/',/] of Qi(f) to 5,(0, 
where (P, /) denotes the path (i, ■ ■ ■ ,j, /). 

To further improve the measure one would like to, assuming 
some null-model, subtract the expected random contribution 
to 5/(0/ A. If this can be achieved one would have a symmetry 



coefficient si(i) that is zero when the symmetry is what can 
be expected from the null-model, larger if i is a center of un- 
expectedly high symmetry, and less than zero if i is degree 
anti-symmetric. A final symmetry coefficient could thus be 
written 



5,(0 



5,(Q-5,(Q 
A -5,(0 



provided A > 5,(0 



(6) 



where v is the expected value of (5,(0 - 5,(0)/(A - Si(iJ) in 
a null-model. A = S ,(0 can only happen if there is one or no 
path of length I. In both these cases the degree-symmetry con- 
cept makes no sense so, if A = Si(i) e {0, 1), we set 5/(0 = 0. 
The null-model we assume is random constrained on the de- 
gree distribution of the network. I.e., given the fraction pk of 
fc-degree vertices the network is as random as possible. As it 
turns out v is tricky to calculate analytically. There are two 
ways to proceed — either one calculates an approximative v or 
one obtains v via averaging (5/(0 - S i(i))/(A - 5,(0) over re- 
alizations of the null-model. Except being more accurate, the 
latter approach has the advantage of giving an error estimate 
of s/(0 — one can by specifying a p-value define significantly 
symmetric, or anti-symmetric, vertices. We will use both ap- 
proaches: The approximative method for analyzing example 
networks and the numerical method for analyzing real-world 
data. 

We obtain an approximative value of v, v app ", by assuming v 
is approximately equal to the probability that a pair of vertices, 
reached by walking along paths, is the same. Note that, since 
there are k ways into a degree-A: vertex, when following a path 
the probability to reach a degree-fc vertex is 



kpk 



kpk 
<*>" 



(7) 



Thus the probability v app - that two vertices of the same degree 
is reached by following different paths is 



,/ipp. 



One reason this approach is not exact is that the number of 
terms in the expression for S,(/) increases with the degree of 
the j in A(P, i) of Eq. (|5J. There are other higher-order effects 
to related to other correlations between the path structure and 
the degree of the vertices. 

To summarize we have two measures of local vertex sym- 
metry, one approximative: 
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5/(0 - S ,(Q 

A -5,(0 (k) 



2 pI 



(9) 



and one based one Monte Carlo sampling 



5,(0-5/(0 / 5/(0 -5/(0 



A -5,(0 



A -5/(0 



(10) 



The sampling is conveniently done by random rewiring the 
edges of the original network l24l) . 
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III. ALGORITHM 

The heart of algorithm, as suggested in the previous section, 
is a depth-first search with depth I. When the returning along 
the traced out paths the branching number can be calculated 
recursively through 



b,(P,0 = 



1 if P has length I 

E/eA((P,/),0 b i(( p > A otherwise 



(11) 



S i(Pf) can be calculated simultaneously using Eq. (|3J- A slight 
complication is that the same vertex may appear in different 
branches of the depth first search while calculating b and s. 
For small cut-off values this is easy to handle: For I = 2 it 
does not affect the calculation at all. For / = 3 one would 
only have to keep different depths (of Eqs. (0 and (II Q ) sep- 
arate. For the calculation of S;(z) the terms of Qi(i) has to be 
stored. Since the number of paths p grows fast with /, this 
can be quite a constraint for a large /. Luckily it suffices to 
store a histogram h(l', k) counting the number of vertices of 
degree k at position V of the paths Qi{i). p (and thus A) can be 
calculated as the number of time the depth I of the depth first 
search is reached. The running time of the algorithm is O(p). 
A mean field approximation for networks with few triangles 
gives O(p) « 0((k)'). 



IV. EXTENSIONS AND CONSIDERATIONS 

The method outlined above can quite straightforwardly be 
extended to network with directed edges, distinct types of 
edges or (integer) edge weights. 

Imagine a network with z different edge sets E\,- ■ ■ ,E Z . 
Such networks frequently occur in cellular biochemistry — e.g. 
protein interaction networks where different types of protein 
interaction can be recorded ( fill) , or gene regulation networks 
where the edges can be activating or inhibitory. One sensible 
way to extend the above procedure is to use the union of the 
edges as your graph but to say two pairs of vertices in (2/(0 are 
identical if their degrees with respect to all of the networks are 
the same. To formalize this Qi(i) would be generalized to 



Q,(i) = {[k^y),---,^^)], 

.[k(v^),---,k(v^)]}, 



(12) 



where k(v) is a vector with v's degrees with respect to the 
different edge-types, and the (5-function of Eq. @ would be 
one if the arguments are equal at all their indices, and zero 
otherwise. The v app has to be redefined too: 
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FIG. 3 Degree symmetries of small example networks, (a) is con- 
sistent with the example Fig.Qa). (b) is an example of a graph with 
only positive degree symmetries, (c) shows a graph with only nega- 
tive degree symmetries. The cut-off length / = 2 is used. 



treated similarly — one consider paths following edges in both 
directions but a vertex pair gives a contribution to s only if 
both the in- and out-degrees are the same. 

The approach of Sect. [TO] can straightforwardly be applied 
to networks where multiple edges are allowed. Since multiple 
edges can be used to model weighted graphs ( 22) the general- 
ization to weighted graphs (at least where edge-weights repre- 
sent the probability of following an edge) is simple. The other 
aspect of multigraphs, self-edges, is trivially dealt with — by 
the requirement that a paths should not intersect themselves 
a self-edge will never be followed and can thus be omitted 
already when the graph is constructed. 

The overlap required for a vertex pair to be considered 
equal in the calculation of the symmetry coefficient is rather 
strict. Sometimes one would like to treat two paths as similar 
even if their degrees differs slightly. Particularly, this applies 
to broad degree distributions. The functional difference be- 
tween degree-2 and degree-3 vertices may be significant; but 
whether a vertex has degree 1002 or 1003 probably does not 
matter. To achieve such a relaxation one can construct a inte- 
ger sequence K\ < K%< •■• and let 



S(k, k') 



1 if Kj <k,k'< Ki+\ for some i 
otherwise 



(14) 



I.e., one construct a series of equivalence classes of vertices. 
For a power-law, or similarly broad, degree distributions one 
can let Ki+\ -Ki increase exponentially with i. In this case one 
also has to modify the definition of v app 



1 I )( 

<^Z Z w z 



\Ki<k<Kn 



kpk 



\K,<k<Ku 



(15) 



-ij J] k' Pk , k" Pk „ Y] J] PiikjW^kjW), (13) 
w v ,,,, i=1 j=l 



k',k" 



where pi(k\k') is the conditional probability that a vertex has 
degree k with respect to edge set E, given that its degree in the 
union network is k' . The case of a directed network can be 



V. DEGREE SYMMETRIES OF EXAMPLE NETWORKS 

In this section we evaluate the measure for example net- 
works and real-world networks. We will use the smallest non- 
trivial cut-off I = 2 throughout this section. Most conclusions 
hold for / = 3 or 4. 
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A. Small test graphs 

To get a feeling for the s/ measure we start by considering 
a few small test networks, see Fig. [3] In Fig. |3Ja) we dis- 
play a network with the same degree symmetry, with respect 
to the central vertex (triangle), as Fig.^ As expected the cen- 
tral vertex has a strong degree symmetry coefficient. To carry 
through the calculation of Eq. © once we obtain the degree 
distribution p2 = 8/13, p3 = 4/13 and p\ - 1/13 giving 
v app - = 165/832 * 0.198. All length-2 paths out from the 
central vertex have the degree sequence (3,2) so 52(A) = 4, 
5 2 (A) = 4 and A = 28 giving ^ (A) = 667/832 * 0.802. 
The degree-3 vertices (squares) have two degree sequences 
of their outgoing paths (4, 3) and (2, 2), whereas paths from 
degree-2 vertices (triangles) have degree sequences (3,4) and 
(2, 3). This difference is larger than expected from the null 
model (random networks with eight degree-2 vertices, four 
degree-3 vertices and one degree-4 vertex), thus the negative 
52 values for these vertices. 

In Fig. OJb) we show a graph where all vertices have pos- 
itive degree-symmetry coefficient. Paths from degree-2 ver- 
tices have only the degree sequence (3,2) and paths from 
degree-3 vertices have only the degree sequence (2, 3). Thus, 
for every vertex, the view of degrees along the path out to the 
rest of the network is the same no matter which direction one 
looks in from that vertex. A radically different view is seen in 
Fig-EJc)- In this case the vertices have three distinct positions 
in the network. The vertices marked with squares have degree 
two and four outgoing paths of degree sequences (2, 4), (4, 4), 
(4, 2) and (4, 2). The circles, despite their different network 
position (as being part of triangles), have the same set of de- 
gree sequences for their paths of length two. The degree-3 
vertices have six length-2 paths: three having the degree se- 
quence (2, 2), three having degree sequence (4, 2). It is easy 
to convince oneself that this close to as dissimilar a network 
with four degree-2 and two degree-4 vertices can be. Conse- 
quently all vertices have negative degree-symmetry indices. It 
is worth pointing out that the Fig. |3Jc) possesses other sym- 
metries than degree-symmetry. The layout has, for example, 
reflexive symmetry along a vertical axis. We emphasize that 
such symmetries would need to be captured by other mea- 
sures. 




FIG. 4 The average approximative symmetry coefficient for 1 = 3 
and random graphs with M = 2N. The line is a fit to a power-law 
decay form (0.124 + 0.435AT 102 , to be exact). 



C. Random graphs 

Next we evaluate the average approximative symmetry co- 
efficient (s app ) for random graphs ( 14) — graphs obtained by 
successively adding M edges between N vertices with the re- 
striction that no multiple edge, or self-edge, may occur. Such 
networks have no correlations at all and can serve as a ref- 
erence point for neutrality Ideally we would like such 
networks to, on average, have a degree-symmetry coefficient 
of zero. As seen in Fig.0](.sj lpp ') converge to a small but posi- 
tive value. The decay is roughly inversely proportional to N — 
the same scaling as the fraction of triangles in the network — 
which suggests that the presence of triangles, and perhaps 
other short-cycles, is an important source of finite size effects 
of sj lpp . We conclude that the Monte Carlo sampling measure 
(or a more elaborate measure) is needed if one wants to 
compare different networks. If, on the other hand, one aims 
to compare different vertices of the same network the faster 
s, (0 calculation is sufficient. This is not an uncommon situ- 
ation in the design of network measures. Another example of 
this where neutrality is non-zero in the large-Af limit is modu- 
larity, measuring how good a subgraphs that are densely con- 
nected within but not between each other (8). 



VI. DEGREE SYMMETRIES OF REAL NETWORKS 

In this section we apply our measures to real-world net- 
works. First we take a look at the symmetry coefficients of 
specific vertices in the metabolic network of humans, then we 
look at the average symmetry coefficients of various classes 
of networks. 



B. Regular networks 

If all vertices have the same degree a network is called reg- 
ular i 141) . Then by definition all paths are known to fully 
overlap. This trivial overlap should be canceled in our sym- 
metry measure so Si(i) = for all I and i. Since S /(/) is the 
number of terms in ?/(/) and all these terms are one we have 
S = 5/(0 = A. Furthermore, v app - = 1 which gives si(i) for 
all vertices and cut-off lengths. 



A. Human metabolic networks 

An important use of statistical graph theory is to charac- 
terize chemical reaction networks. Of many possible network 
representations 1 1291) we let vertices be chemical substances, 
and for all reactions of an organism we link substrates with 
products. For example, the hypothetical reaction A + B < — > 
C + D would contribute with the edges (A, C), (A,D) and 
(B, C), (B, D) to the metabolic network. The data is derived 
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their 2-neighborhoods are of appropriate sizes, neither too 
big, nor too small, to be displayed and described. Sper- 
mine, Fig. |5l a X is a substance with high degree-symmetry — 
s^ c = 0.89 + 0.02. Both its neighbors are in the same degree- 
equivalence class of vertices with degree four to seven. Of ver- 
tices two steps away from spermine there is also a significant 
overlap with two (out of four) neighbors to the neighbor sper- 
midine being in the equivalence class defined by degrees in the 
interval [8, 16); whereas two vertices are in the equivalence 
class of degrees in [4, 8). The three paths from spermine via S- 
adenosylmethioninamine also contribute to the overlap in the 
two steps from spermine as two vertices (methylthioadenosine 
and spermindine) have degrees in the same equivalence class. 
The neighborhood of C04850, seen in Fig. |5jb), is visually 
less balanced and also having a negative degree-symmetry — 
5 mc _ _qh ± o.Ol. We note that there are some vertex pairs 
in the second neighborhood whose degree-classes overlap, but 
apparently this is not enough to make the symmetry coefficient 
non-negative. 



B. Average symmetry values 



FIG. 5 The 2-neighborhood of spermine — a vertex with high 
degree- symmetry — (a), and C04850 — a vertex with low degree 
symmetry — (b), in the human metabolic network. The symbols indi- 
cate the equivalence classes defined by exponentially growing inter- 
vals. Filled circles have degree two, unfilled circles have degree four 
or five, a vertex symbolized by an rc-gon have degree in the interval 
[2",2" +1 ). In case the chemical names are overly long the KEGG 
codes are given ("C" and five digits): C07282 represents eIF5A- 
precursor-deoxyhypusine, C04850 represents l,3-/8-D-galactosyl-(a-- 
l,4-L-fucosyl)-N-acetyl-D-glucosaminyl-R, C04556 represents 4- 
amino-2-methyl-5-phosphomethylpyrimidine, C04467 represents a- 
L-fucosyl-l,2-y8-D-galactosyl-R and C01311 represents l,4-/?-D- 
galactosyHa'- 1 ,3-L-fucosyl)-N-acetyl-D-glucosaminyl-R. 

from the KEGG database (http://www.genome.jp/!, and 
described in detail in Ref. ( 13). Since the degree distributions 
of metabolic networks are highly skewed (16) we use a expo- 
nentially increasing set of intervals as equivalence classes (as 
discussed in the connect of Eq. H4\ ): K„ = 2". 

It has been argued that degree is strongly related to the func- 
tion of the chemical substance Q Il6l) . This means that the 
degree symmetry potentially can give additional information 
about the function of the vertices. For the human metabolic 
network, and / = 2, roughly half of the vertices have a p-value 
of less than 5% (i.e., in the null-model sampling of the calcu- 
lation of s^ c , less than 5% or more than 95% of the values of 



are smaller than the value of the real network). In Fig. |5ja) 
we show the 2-neighborhood of one vertex with signifi- 
cantly higher s™ c than expected; Fig. |5jb) depict the 2- 
neighborhood of a vertex with significantly higher s^ c . The 
reason these particular vertices are used as examples is that 



So far we have discussed degree symmetries of vertices. In 
this section we average s/ over V to obtain a graph-wide mea- 
sure for degree symmetry. In Table U we display values of 
s^ c for a number of different network types. Some of these 
have highly skewed degree distributions. For these, the expo- 
nentially increasing degree equivalence classes of Sect. IVI.AI 
are appropriate. Since we intend to compare all networks we 
use the same equivalence classes for all networks. The first 
observation is that almost all networks have a positive aver- 
age symmetry coefficient. The only clear exception is the 
airport network. This means that if you start a two-leg air- 
plane trip at a particular airport, choosing between two ran- 
dom itineraries (without caring about the frequency of flights), 
then the probability of the airports along these itineraries be- 
ing different in number of connections is smaller than in a ran- 
dom network. The strongest degree-symmetries are found in 
one-mode projections of social affiliation networks. Note that 
the other social networks, derived from questionnaires and 
electronic communication does not have such strong symme- 
try coefficients. In one-mode projections high-degree vertices 
are known to have strong tendency to attach to other high- 
degree vertices, and low-degree vertices to attach to other 
low-degree-vertices — so called assortative mixing I l20t) . If 
this property is strong there will be regions of vertices with 
high degree and other regions with low-degree vertices. The 
paths within these regions would also have similar degree se- 
quences. Thus high assortative mixing can be related to high 
degree symmetry, the first causing the second or vice versa. 
They are, of course, not equivalent — e.g., the example net- 
work with all vertices having positive symmetry coefficients 
(Fig. |2b)) is maximally disassortatively mixed (in the sense 
of Ref. (20)). Where the weak symmetry coefficients of other 
networks come from is outside the scope of this investigation. 
One possible explanation would be that functional units J2r3) 
might often be degree-symmetric centers. 
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TABLE I The network sizes N and M and the average numerical degree-symmetry coefficient s^ c of real-world networks. In the interstate 
network the vertices are American interstate highway junctions and two junctions are connected if there is a road with no junction in between. 
In the street networks the vertices are Swedish city-street segments connected if they share a junction. In the airport network (obtained from 
http://vlado.fmf.uni-lj.si/pub/networks/pajek/data/gphs.htm l the vertices are American airports and edges represent a regular, non-stop route. 
In the citation networks the vertices are papers and two papers are connected if they one cites the other. The "scientometrics" network consists 
of papers from the journal Scientometrics. The "small-world" network are all papers citing Ref. fl^) or having the phrase "small world" in the 
title. (The citation networks were obtained from http://vlado.fmf.uni-lj.si/pub/networks/data/cite/ These networks are the result of searches in 
the WebofScience used with the permission of ISI Philadelphia.) The board of directors and Ajou student networks are derived from one-mode 
projections of affiliation networks (where edges goes from persons to corporate boards and university classes respectively). The Ajou student 
network is averaged over graphs of 16 semesters. One edge represent two students taking at least three classes together that semester. The high 
school networks are gathered from questionnaires — an edge means that two persons have listed each other as acquaintances. It is averaged over 
84 individual schools. In the electronic communication networks one edge represent that at least one of the vertices has contacted the other 
over some electronic medium. The food webs are networks of water-living species and an edge means that one species prey on the other. For 
the protein networks an edge means that two proteins interact (the two graphs correspond to two different types of experiments determining 
the interaction edges). The metabolic networks consist of chemical substances and edges are constructed as described in Sect. IVI.Al Values 
for animal metabolism is averaged over six networks, fungi metabolism is averaged over two, and bacteria metabolism is averaged over 96 
networks. 





network 




Ref. 


N 


M 


r MC 


geographical networks 




interstate highways 




935 


1315 


0.016 ±0.003 






streets, Stockholm 


Q5) 


3325 


5100 


0.014 ±0.003 






streets, Malmo 


Q5J 


1868 


3026 


0.020 ± 0.003 






streets, Goteborg 


(25) 


1258 


1516 


0.026 ± 0.003 






airport 




332 


2126 


-0.0573 ± 0.0002 


citation networks 




scientometrics 




2728 


10398 


0.015 ±0.020 






small-world 




233 


994 


0.007 ± 0.002 


one-mode projections of 




board of directors 


(4J 


6193 


43074 


0.175 ±0.004 


affiliation networks 




Ajou University students 


(12) 


7285 ± 128 


75898 ± 6566 


0.13 ±0.01 


acquaintance networks 




high school friendship 


(2) 


571 ±43 


1104 ±60 


0.020 ± 0.002 


electronic communication networks 


e-mail 


ffi 


3186 


31856 


-0.01 ± 0.01 






Internet community 


(10) 


28295 


115335 


0.01898 ±0.0001 


food webs 




Little Rock lake 


(18) 


92 


960 


0.042 ±0.001 






Ythan estuary 




134 


593 


0.027 ± 0.002 


neural network 




C. elegans 


(28) 


280 


1973 


0.0839 ± 0.0001 


biochemical networks 




S. cervisiae protein 


QT; 23) 


4580 


7434 


0.0205 ± 0.0001 






S. cervisiae genetic 


Ql;23) 


4580 


5129 


0.0996 ± 0.0001 






animal metabolism 


(TJ) 


1621 ± 123 


4662 ± 473 


0.02 ± 0.01 






plant metabolism, A. thaliana 


(13) 


1561 


4302 


0.0133 ± 0.0003 






fungi metabolism 


(13) 


1281 ±97 


3654 ± 289 


0.03 ± 0.02 






bacteria metabolism 


(TJ) 


1070 ± 35 


2776 ± 109 


0.018 ± 0.002 



VII. SUMMARY AND CONCLUSIONS 

We have derived a measure for a specific notion of symme- 
try in networks — the property that the paths out from a vertex 
have overlapping degree sequences. The measure is designed 
so that random networks, conditioned only to have the same 
set of degrees as the original network, have the value zero. 
We propose two versions of the symmetry coefficient, the first 
being approximately zero for random networks, the second 
requiring a randomization procedure (and thus longer simu- 
lation time) but being more accurately zero for random net- 
works. The measure was evaluated on example graphs. We 
show that they are able to detect vertices in degree-symmetric, 
and potentially functionally meaningful positions in the hu- 
man metabolic network. The average degree-symmetry of 
various networks were also investigated. We found almost 
all networks having a weakly positive degree coefficient. The 
exceptions being the network of American airports and their 
interconnections (having a negative degree-symmetry coeffi- 



cient) and one-mode projections of social affiliation networks 
(having rather strongly positive values). Our measure is not 
the first to be based on a the properties of paths going out 
from a vertex. For example people have been using path 
counts for assessing the functional similarity of pairs of ver- 
tices In social network studies such measures are 
commonly called "ego-centric" d27h . 



Symmetry concepts have been successfully utilized in 
many field of physics. We believe degree symmetry, and other 
classes of network symmetries, will be a fruitful direction 
of future network studies. Degree symmetry is in particular, 
we believe, an important concept for networks where degree 
is strongly related to the function of the vertex. Two open 
questions from this study is what causes the rather ubiquitous 
weakly positive degree symmetries, and what process in the 
airline decision making that causes the negative average sym- 
metry coefficient of the airline network. 
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